Automatic pitch marking and reconstruction of glottal closure instants from noisy and deformed electro-glotto-graph signals
نویسندگان
چکیده
Pitch tracking and pitch marking (PM) are two important speech signal analysis techniques for several applications. The accuracy of both pitch marking and tracking is significant to generate smooth synthesized speech by controlling the pitch and duration of voiced speech in Text-to-Speech (TTS) system for example. In this paper, we present a novel hybrid approach, combining electro-glotto-graph (EGG)-based PM and speech signal-based PM into a single framework, to acquire more reliable and automatic PM technique. Experimental results show that the PM performance of the suggested method is excellent being capable of determining Glottal Closure Instants (GCI) precisely even in the case of noisy EGG signals.
منابع مشابه
An automatic pitch-marking method using wavelet transform
This paper describes a new automatic pitch-marking method using wavelet transform. This method detects discontinuity in the speech waveform which occurs at the glottal closure instant (GCI). A time domain prosodic modification technique requires an appropriate determination of the synthesis pitch-marks. We evaluated the performance of the newly developed pitchmarking method by using our interna...
متن کاملMaximum a posteriori pitch tracking
A Maximum a posteriori framework for computing pitch tracks as well as voicing decisions is presented. The proposed algorithm consists of creating a time-pitch energy distribution based on predictable energy that improves on the normalized cross-correlation. A large database is used to evaluate the algorithm’s performance against two standard solutions, using glottal closure instants (GCI) obta...
متن کاملProsodic manipulation using instants of significant excitation
This paper proposes a technique for prosodic (pitch and duration) manipulation using instants of significant excitation. Instants of significant excitation correspond to the instants of glottal closure (epochs) in voiced speech and to some random excitations like burst onset in the case of nonvoiced speech. Instants of significant excitation are computed from the average group delay of minimum ...
متن کاملClassification-Based Detection of Glottal Closure Instants from Speech Signals
In this paper a classification-based method for the automatic detection of glottal closure instants (GCIs) from the speech signal is proposed. Peaks in the speech waveforms are taken as candidates for GCI placements. A classification framework is used to train a classification model and to classify whether or not a peak corresponds to the GCI. We show that the detection accuracy in terms of F1 ...
متن کاملExploring Bessel Features for Detection of Glottal Closure Instants
For voiced speech, the most significant excitation takes place around the instant of glottal closure. Glottal closure instants (GCI) information is useful for accurate speech analysis. In particular accurate spectrum analysis is performed by considering the speech in the intervals of glottal closure. In this paper we propose an approach for detection of GCI by exploring Bessel feature, and the ...
متن کامل